Probably Approximately Optimal Satisscing Strategies
نویسندگان
چکیده
A satisscing search problem consists of a set of probabilistic experiments to be performed in some order, seeking a satisfying connguration of successes and failures. The expected cost of the search depends both on the success probabilities of the individual experiments, and on the search strategy, which speciies the order in which the experiments are to be performed. A strategy that minimizes the expected cost is optimal. Earlier work has provided \optimizing functions" that compute optimal strategies for certain classes of search problems from the success probabilities of the individual experiments. We extend those results by providing a general model of such strategies, and an algorithm pao that identiies an approximately optimal strategy when the probability values are not known. The algorithm rst estimates the relevant probabilities from a number of trials of each undetermined experiment, and then uses these estimates, and the proper optimizing function, to identify a strategy whose cost is, with high probability , close to optimal. We also show that if the search problem can be formulated as an and-or tree, then the pao algorithm can also \learn while doing", i.e. gather the necessary statistics while performing the search.
منابع مشابه
Probably Approximately Optimal Satisficing Strategies
A satisscing search problem consists of a set of probabilistic experiments to be performed in some order, seeking a satisfying connguration of successes and failures. The expected cost of the search depends both on the success probabilities of the individual experiments, and on the search strategy, which speciies the order in which the experiments are to be performed. A strategy that minimizes ...
متن کاملSatisscing Action Selection
It is argued that for real-world applications action selection should be satisscing, i.e. merely`good enough' rather than optimal. It is then demonstrated that multiple objective decision theory provides a suitable framework for formulating action selection mechanisms that are satisscing. A set of experiments demonstrate the potential advantages of the proposed action selection mechanisms.
متن کاملNonlinear Robust Regulation Using the Satisscing Paradigm
In this paper we use the recently introduced concept of satisscing decision theory to develop robustly stabilizing state-feedback control strategies for nonlinear systems subject to unknown disturbances. The result is a point-wise \robust satisscing" set of control options from which a continuous feedback strategy can be selected. We show that the set of continuous \ro-bust satisscing" control ...
متن کاملA Theory of Satis cing Control A
The existence of an optimal control policy and the techniques for nding it are grounded fundamentally in a superlative perspective. These techniques can be of limited value when the global behavior of the system is diicult to characterize , as it may be when the system is nonlinear, when the input is constrained, or when only partial information is available regarding system dynamics or the env...
متن کاملSatisscing Games
Optimization is a mathematical sophistication of the common sense view that decision makers ought to do the best they can, and leads to well known solution concepts for mathematical games. For multi-agent systems in naturalistic settings, however, uncertainty and complexity may subordinate optimal performance to the more moderate goals of functionality and robustness. Moderation is another comm...
متن کامل